Internship Report Compositions of Extended Top-down Tree Transducers

نویسندگان

  • Aurélie Lagoutte
  • Andreas Maletti
چکیده

Many aspects of machine translation of natural languages can be formalized by employing weighted finite-state (string) transducers [22, 40]. Successful implementations based on this wordor phrasebased approach are, for example, the At&t Fsm toolkit [41], Xerox’s finite-state calculus [24], the Rwth toolkit [23], Carmel [19], and OpenFst [2]. However, the phrase-based approach is not expressive enough, for example, to easily handle the rotation needed in the translation of the English structure NP-V-NP (subject-verb-noun phrase) to the Arabic structure V-NP-NP. A finitestate transducer can only implement this rotation by storing the subject, which might be very long, in its finite memory. Syntax-based (or tree-based) formalisms can remedy this shortage. An example of such formalisms is the top-down tree transducer [42, 43], of which a weighted version is implemented in the toolkit Tiburon [38], together with some standard operations. Those weighted top-down tree transducers [29, 14, 17] (also called ‘tree series transducers’) are a joint generalization of the unweighted top-down tree transducer (tdtt) [42, 43] and the weighted tree automaton [7, 10, 1, 27, 16, 9, 8]. During my internship, I investigated compositions of weighted and unweighted extended topdown tree transducers. An unweighted tree transducer computes a relation τ between input and output trees, and a weighted tree transducer computes a weighted relation between input and output trees (i.e., it assigns a weight to each pair of input and output trees). An unweighted tree transducer can be seen as a weighted tree transducer, where the weighted relation assigns true (resp. false) to a pair of trees if this pair is (resp. is not) in the relation τ . Using the real numbers as weight structure, we compose two weighted relations τ1 : A × B → IR and τ2 : B × C → IR by requiring that (τ1 ; τ2)(a, c) = ∑

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Compositions of Extended Top-down Tree Transducers

Unfortunately, the class of transformations computed by linear extended top-down tree transducers with regular look-ahead is not closed under composition. It is shown that the class of transformations computed by certain linear bimorphisms coincides with the previously mentioned class. Moreover, it is demonstrated that every linear epsilon-free extended top-down tree transducer with regular loo...

متن کامل

Composition Closure of ε-Free Linear Extended Top-Down Tree Transducers

The expressive power of compositions of linear extended topdown tree transducers with and without regular look-ahead is investigated. In particular, the restrictions of ε-freeness, strictness, and nondeletion are considered. The composition hierarchy is finite for all ε-free variants of these transducers except for ε-free nondeleting linear extended top-down tree transducers. The least number o...

متن کامل

Survey: Weighted Extended Top-Down Tree Transducers Part III - Composition

In this survey (functional) compositions of weighted tree transformations computable by weighted extended top-down tree transducers are investigated. The existing results in the literature are explained and illustrated. It is argued, why certain compositions are not possible in the general case, and 3 informed conjectures provide an insight into potentially 3 new composition results that extend...

متن کامل

Compositions of Top-down Tree Transducers with "-rules

Top-down tree transducers with "-rules ("tdtt) are a restricted version of extended top-down tree transducers. They are implemented in the framework Tiburon and ful ll some criteria desirable in a machine translation model. However, they compute a class of transformations that is not closed under composition (not even for linear and nondeleting "tdtt). A composition construction that composes "...

متن کامل

Extended Multi Bottom-Up Tree Transducers Composition and Decomposition

Extended multi bottom-up tree transducers are de ned and investigated. They are an extension of multi bottom-up tree transducers by arbitrary, not just shallow, left-hand sides of rules; this includes rules that do not consume input. It is shown that such transducers, even linear ones, can compute all transformations that are computed by linear extended top-down tree transducers, which are a th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011